Querying Multiple Bioinformatics Information Sources:

نویسندگان

  • David Buttler
  • Matthew Coleman
  • Terence Critchlow
  • Renato Fileto
  • Wei Han
  • Ling Liu
  • Calton Pu
  • Daniel Rocco
  • Li Xiong
چکیده

Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources are often complex associative queries over multiple Web documents. Most associations are defined by string matching of textual fragments in two documents. Second, most of the queries required by Genomics researchers involve complex data extraction, and sophisticated workflows that implement the complex associative access. Third but not the least, complex Genomics-specific queries are often reused many times by other Genomics researchers, either directly or through some refinements, and are considered as a part of the research results by Genomics researchers. In this short article we present a list of challenging issues in supporting effective querying over bioinformatics data sources and illustrate them through a selection of representative search scenarios provided by biologists. We end the article with a discussion on how the state-of-art research and technological development in Semantic Web, Ontology, Internet Data Management, and Internet Computing Systems can help addressing these issues.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help?

Advances in Semantic Web and Ontologies have pushed the role of semantics to a new frontier: Semantic Composition of Web Services. A good example of such compositions is the querying of multiple bioinformatics data sources. Supporting effective querying over a large collection of bioinformatics data sources presents a number of unique challenges. First, queries over bioinformatics data sources ...

متن کامل

BioGuideSRS: querying multiple sources with a user-centric perspective

UNLABELLED Biologists are frequently faced with the problem of integrating information from multiple heterogeneous sources with their own experimental data. Given the large number of public sources, it is difficult to choose which sources to integrate without assistance. When doing this manually, biologists differ in their preferences concerning the sources to be queried as well as the strategi...

متن کامل

Preferences for Queries in a Mediator Approach

The problem of integrating relevant information obtained from multiple heterogeneous sources is a complex task, with which biologists are now faced. In this paper, we address the problem of querying biomedical databases in a mediator context. We propose to exploit the metadata of the sources to take into account user preferences. The mediator system we present is designed within a tractable log...

متن کامل

A User-Centric Framework for Accessing Biological Sources and Tools

Biologists face two problems in interpreting their experiments: the integration of their data with information from multiple heterogeneous sources and data analysis with bioinformatics tools. It is difficult for scientists to choose between the numerous sources and tools without assistance. Following a thorough analysis of scientists’ needs during the querying process, we found that biologists ...

متن کامل

Enhanced Bioinformatics Data Modeling Concepts and Their Use in Querying and Integration

ENHANCED BIOINFORMATICS DATA MODELING CONCEPTS AND THEIR USE IN QUERYING AND INTEGRATION FENG JI, Ph.D. The University of Texas at Arlington, 2008 Supervising Professor: Ramez Elmasri In bioinformatics research, scientists usually face the problems of modeling complex data types and integrating diverse resources. Traditional data models such as EER lack the expressing power to capture many char...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002